PCX 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
Intematianal Bureau 




INTERNATIONAL APPUCATION PUBUSHED UNDER THE PATENT (XK3PERATION TREATY (PCD 



(51) International Patent Classification ^ : 

C12N 1S«2, 1S«9, com 14/415, C12N 
5/10, AOIH 5/00 



A2 



(11) International Publication Number: 
(43) Intematloiia) Publication Date: 



WO 98/08961 

S March 1998 (05.03.98) 



(21) IntemaUonal Application Number: PCT/IB97/01037 

(22) International FDIng Date: 28 August 1997 (28.08.97) 



(30) Priority Data: 
60/024,886 



30 August 1996 (30-08.96) 



US 



(71X72) Applicants and inventors: OLSEN. Odd-Ame fNO/NOl; 
Taamvcien 16. N-1430 Aas (NO). DOAN. Danny, N.» P. 
[AU/NOl; Lyngvcicn 14B, N-1430 Aas (NO). LINNES- 
TAD. Casper [NO/NO]; Ovre Lmncstad, N-1540 Vestby 
(NO). 



(81) DG^gnated States: AU AM, AT. AU. BA, BB. BG, BR, BY, 
CA. CH. CN, CU, CZ, DE, DK, EE. ES. FI, GB, GE. GH, 
HU. IL, IS. JP. KE, KG, KP. KR. KZ. LC. LK. LR, LS. LT. 
LU, LV. MD, MG. MK. MN, MW. MX. NO. N2. PU PT, 
RO, RU, SD. SE, SG. SI, SK, SU TI, TM, TR, TT. UA, 
UG, UZ, VN, YU, ZW, ARIPO patent (GH, KE, LS. MW. 
SD, SZ, UG. ZWX Eurasian patent (AM, AZ, BY, KG. KZ. 
MD. RU. TI, TM). European patent (AT, BE, CH, DE, DK, 
ES, FI. FR. GB. GR, IE, IT. LU, MC, NL, PT, SE), OAPI 
patent (BF, BJ. CF, CG, CI, CM, GA, GN. ML. MR. NE. 
SN. TD. TG). 



Published 

Without international search report and to be republished 
upon receipt of thai report. 



(54) Tltte: ENDOSPERM AND NUCELLUS SPECIFIC GENES, PROMOTERS AND USES THEREOF 

G H 




I J 

(57) Abstract 

Isolated nucleic acid molecules are provided which encode ENDl and NUCl, cndospenn proteins. Also provided are vectois which 
are capable of expressing such nucleic acid molecules, host cells which contain such vectors, and polyp eptide s encoded by the afore- 
mentioned nucleic acids. In addition, nucleic acid molecules arc provided which comprise ENDl or NUCl promoters. 



Best Available Copy 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States paxty to ttie PCT on the front pages of pamphlets publishing intemational applications under the PCT. 



AL 


Albania 


ES 


Spain 


L5 


Letoiho 


Si 


Slovenia 


AM 


Anncnia 


n 


Finland 


LT 


LidHiania 


SK 


Slovakia 


AT 


Anstria 


FR 


ftsnoe 


LU 


IjuyCTnbourg 


SN 


Senegal 


AU 


AostnlU 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AX 


Azerbaijm 


GB 


Unked Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Hcnegovbia 


GB 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BE 


Bartwlos 
Bclgiimi 


GR 


Ghana 


MG 


Madagascar 


TJ 


Ta^isun 


BB 


GN 


Guinea 


MK 


tht fbrroer Yugoslav 


TM 


T^nkmenstan 


BF 


BinMna Paso 
Bulgaria 


GR 


Gieeee 




Rqmblic of Maoedonia 


TR 


Tlnkcy 


BG 


HU 


Hnngaiy 


ML 


MaU 


XT 


TMaidad and Tobtgo 
Iftnine 


Bl 


Benin 


IB 


beland 


MN 


Mongolia 


UA 


BR 
BY 


Brazil 


IL 


Israel 


MR 




UG 


Ugmdn 


Bclajus 


IS 


Iceland 


MW 


Malawi 


US 


United SStaiBS of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


UZ 


Uzbekistan 


CF 


CemnI African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


K£ 


Kenya 


NL 


NeiherUnds 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyigyzatan 


NO 


Norway 


zw 


Zinibal>we 


CI 


Cdce d'lvoire 


KP 


Democratic People** 


NZ 


New Zealand 






CM 


Caincfoon 




Republic of Kaea 


PL 


Poland 






CN 


China 


KB 


Republic of Korea 


FT 


Portugal 






CU 


Cuba 


KZ 


Kaxakitan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Safaiil^a 


RU 


Russian Pedcmicm 






DK 


Germany 


U 


Liechlattlein 


SD 


Sudan 






DK 


Dennsifc 


LK 


Sri Lanka 


SB 


Sweden 






BB 


Eitooia 


LR 


Liberia 


SG 


Singapore 







W0 98A)8961 



1 



PCT/IB97AH037 



Description 



ENDOSPERM AND NUCELLUS SPECIFIC GENES, PROMOTERS 

AND USES THEREOF 

5 

Technical Field 

The present invention relates generally to endosperm specific genes in 
plants, and more specifically, to coding regions and promoters of these genes. 

10 Background of the Invention 

In cereal grain plants, such as barley, the endosperm is the main 
storage organ, providing nutrients to the developing and to the germinating embryo. 
Altering the types or amounts of nutrients in the endosperm could improve cereal 
grains as a food source for the world. In addition, developing seeds are susceptible to 

15 diseases caused by pathogens and insects. Increasing plant defenses at this stage of 
development could improve crop yields. 

Currently, genetic engineering of plants involves transformation of the 
entire plant and generally does not allow control of expression of transgenic products 
in an organ-specific manner. In particular, elements required for expression of 

20 products early in embryo development are not available. 

In view of the problems in obtaining expression of transgenic products 
in developing embryos of plants, there is a compelling heed for improved expression. 
The present invention exploits the use of plant genes that are highly and specifically 
^pressed in developing grains to alter the storage nutrients and capabilities of disease 

25 defenses, vsdaile providing other related advantages. 

Summary of the Invention 

Within one aspect of the present invention, an isolated nucleic acid 
molecule comprising an END I promoter is provided. Within a related aspect, a 
30 NUCl promoter is provided. In yet other aspects, isolated nucleic acid molecules 
encoding a product of ENDl and NUCl are provided. 
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In another aspect, a vector comprising one of the nucleic acid 
molecules is provided. In preferred embodiments, the vector is an expression vector. 
When the nucleic acid molecule comprises the END! or NUCl promoter, the vector 
further comprises a foreign gene operably linked to the promoter. In a preferred 
5 embodiment, the foreign gene confers disease resistance. 

In other aspects, methods of producing a foreign gene product or a 
plant that expresses a foreign gene are provided. 

These and other aspects of the present invention will become evident 
upon reference to the following detailed description and attached drawings. In 
10 addition, various references are set forth below which describe in more detail certain 
procedures or compositions (e.^., plasmids, etc.), and are therefore incorporated by 
reference in their entirety. 

Brief Description of the Drawings 

15 Figure 1 shows light microgr^h depicting the morphology and 

anatomy of barley ovaries and young grains. A. Light micrograph of intact ovary at 
5 DAP. B. Light micrograph of transverse section (2 jim) through the mid-portion of 
5 DAP ovary. From the outside and in, the pericarp consists of an epidermal layer, 
several layers of parenchyma cells, two layers of photosynthetic cross cells (c) and an 

20 inner epidermis of tube cells (the crushed cell layer) (cc). Testa (t) has an outer 
epidermis of thin walled cells and an inner layer of thick walled cells, the nucellus (n) 
has an outer epidermis and several layers of inner cells which at 5 DAP are present 
only on the ventral side. In the center of the grain at the ventral side, nucellus forms 
the nucellar projection (np), which extends into the endosperm cavity. The 

25 endosperm coenocyte (c) consists of one layer of NCDs at the arboreal stage 
surrounding the large central vacuole (cv). C. Right half of transverse section of 9 
DAP grain showing yoimg cellular endosperm with anticlinal walls (a) extending to 
the center of the endosperm from both sides and periclinal walls (p) in the periphery. 
The nucellar epidermis (ne) consists of highly vacuolated cells on the outside of the 

30 endosperm. D. Scanning electron micrograph ofisolatedembryosac covered by testa 
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and nucellus. Bars represent 1 mm in A, 200 ^m in B, 100 ^m in C and 500 ^m in 
D. 

Figure 2 jnresents in situ hybridi2ation analyses of ENDl and NUCl 
transcripts in the barley endosperm coenocyte and nuceilar tissues. A. Dark-field 
5 micrograph showing the presence of ENDl transcripts in the endospemi coenocyote 
over the nuceilar projection at 6 DAP. B. Higher magnification of a selected area of 
A. C. Same as in A. The picture is a double exposure composed of a dark-field 
micrograph showing silver grains in yellow and a phase contrast micrograph 
enhancing histological details. D. Higher magnification of a selected area of C. 

10 E. Accumulation of the ENDl transcripts on the ventral side of the endosperm over 
the nuceilar projection right after endosperai cellularizatton at S DAP. F. END 1 
transcripts in modified aleurone cells and ventral starchy endospemi cells at 1 0 DAP. 
G. Dark-field microgr^h of longitudinal section of unfertilized ovary (0 DAP) 
showing the presence of NUCl transcripts in the nucellus. H. NUCl transcripts in 

IS the nucellus of transverse section of a 2 DAP grain. The picture is a double exposure 
composed of a darkfield microgr^h showing silver grains in yellow and a phase 
contrast micrograph enhancing histological details. L NUCl transcripts in the 
nuceilar epidermis and lateral part of the nuceilar projection in transverse section of a 
7 DAP grain. J. Accumulation of NUCl transcripts in the lateral elements of the 

20 nuceilar projection and in the nuceilar epidermis at 10 DAP. Bars in figures A, C, E, 
G and I represent 200 fun; B, D, and F, 80 jxm; H and J, 100 fxm. NP, nuceilar 
projection; NE, nuceilar q)idermis; P, pericarp; N, nucellus; CV, central vacuole; ES, 
endosperm synctium; V, ventral vascular strand; AP, antipodal cells; S, stigma. 

Figure 3 is a Northern blot analysis of ENDl and NUCl transcripts in 

25 developing barley grains. Poly(A)-rich RNA (100 ng per lane) of intact ovaries and 
grains firom 0 to 30 DAP were hybridized to single stranded probes of the ENDl and 
NUCl cNDAs. The lower panel shows a control hybridization with a barley histone 
H3 probe. The same blot was used for all three probes. 
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Figure 4 presents nudeotide sequences of the END 1 and NUC 1 cDN A 
clones isolated in the differential screening experiment These sequences were used 
as probes in the in siiu and Northern blot hybridizations of Fig. 2 and 3. 

5 Detailed Description of the Invention 

Prior to setting forth the invention, it may be helpful to an 
understanding thereof to set forth definitions of certain terms that will be used 
hereinafter. 

As used herein, "ENDl" and "NUCl" refer to genes that are 

10 preferentially expressed in endosperm and nucellus, respectively. The amino acid 
sequences of representative ENDl and NUCl gene products from barley (Hordeum 
vtdgare L. cv. Bomi) have been deduced and are presented in SEQ ID NOS:3-6. 
Within the context of this invention, it should be understood that ENDl and NUCl 
include wild-type protein sequence, as well as other variants (including alleles) of the 

15 native protein sequence. Briefly, such variants may result from natural 
polymorphisms or be synthesized by recombinant methodology, and differ from wild- 
type protein by one or more amino acid substitutions, insertions, deletions, or the like. 
Typically, amino acid substitutions are conservative. In the region of homology to 
the native sequence, variants should preferably have at least 90% amino acid 

20 sequence identity, and within certain embodiments, greater than 92%, 95%, or 97% 
identity. As will be appreciated by those skilled in the art, a nucleotide sequence 
encoding ENDl, NUCl or variant may differ from the native sequences presented in 
SEQ ID NOs:l and 2, due to codon degeneracies, nucleotide polymorphisms, or 
amino acid differences. 

25 As used herein, a "promoter" refers to a nucleotide sequence that 

contains elements that direct the transcription of a linked gene. At minimimi, a 
promoter contains an RNA polymerase binding site. More typically, in eukaryotes 
promoter sequences contain binding sites for other transcriptional factors that control 
the rate and timing of gene expression. Such sites include TATA box, CAAT box, 

30 POU box, API binding site, and the like. Promoter regions may also contain 
enhancer elements. 
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An ^'isolated nucleic acid molecule*" refers to a polynucleotide 
molecule in the form of a separate fragment or as a component of a larger nucleic acid 
constmct, that has been separated from its source cell (including the chromosome it 
normally resides in) at least once in a substantially pure form. Nucleic acid 
5 molecules may be comprised of a wide variety of nucleotides, including DNA, RNA, 
nucleic acid analogues, or some combination of these. 

A nucleotide molecule having "ENDl promoter activity" refers to a 
promoter region containing elements responsive to factors that control END! gene 
expression. In general, such a sequence promotes a similar expression pattern as for 
10 native ENDl {e.g., tissue specificity, developmental timing). A sequence with ENDl 
promoter activity may vary from the native sequence by base substitutions, insertions, 
and deletions, for example. Such alterations may affect the relative strength of the 
promoter, but should not affect the expression pattern. A nucleotide sequence having 
"NUCI promoter activity" is defined in similar &shion as for a sequence having 
1 5 END 1 promoter activity. 

A. END 1 AND NUC I Genes and Gene Products 

As noted above^ the present invention provides compositions relating 
to ENDl and NUCl genes, and methods for the use of the ENDl and NUCl gene 
20 products. 

Given the disclosure provided herein, the ENDl and NUCl genes, as 
well as other developmental-specific genes, including endosperm-specific and 
nucellus-specific genes, can be readily isolated from a variety of plants, including for 
example, barley (Hordeum). For example, within one embodiment, genes encoding 

25 ENDl and NUCl can be identified by differential hybridization. Briefly, in diis 
method, a cDNA library is constructed, such as in a lambda vector. The recombinant 
clones are plated and clone DNA is transferred to a solid support (e.g., nylon filters). 
Labeled probe is prepared from RNA isolated from a positive source, such as 
endosperm, and a negative source, such as pericarp. Probe is hybridized to duplicate 

30 filters and clones hybridizing with the positive, but not with negative, probe are 
identified and isolated. 
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Other methods for identifying differentially expressed genes are also 
readily available {see. for example, Sambrook et al.. Molecular Cloning: A 
Laboratory Manual^ Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
1989; Ausubel et al.. Current Protocols in Molecular Biology, Greene Publishing, 
5 1987). Briefly, the methods graerally £ail into three major categories: subtractive 
hybridization, differential screening, and random-primed amplification. 
Combinations and variadons of these methods are also widely used. For example, a 
library pre-enriched for differentially expressed transcripts by subtactive hybidization 
can be prepared {see, e.g., Sive and St John, l^ucl. Acids Res. 16: 10937, 1988) and 

10 screened as above. Random-primed PGR is also widely employed {see, e.g. Methods 
Enzymol. 254: 275, 1995; Trends Genet. II: 242, 1995; Liang and Pardee, Science 
257: 967, 1992; Welsh et al., NucL Acids Res. 20: 4965, 1992). In addition, 
variations of random-primed PCR may also be used, especially when a particular 
gene or gene femily is desired. In such a method, one of die amplification primers is 

15 an anchored oligo(dT) (oligo(dT)dN) and the other of the primers is a degenerate 
primer based upon amino acid or nucleotide sequence of a related gene. 

Alternatively, other methods may be used to obtain a nucleic acid 
molecule that encodes ENDl, NUCl, or other developmental-specific gene. For 
example, a nucleic acid molecule may be obtained from a cDNA or genomic 

20 expression library by screening with an antibody or antibodies reactive to one or more 
of these gene products {see, Sambrook, et al. supra; Ausubel, et al. supra). Or, using 
the sequence infomiation provided herein, a probe sequence can be synthesized and 
labeled, such as with a radioactive label, enzymatic label, protein label, fluorescent 
label, or the like, and hybridized to a genomic library or a cDNA library constructed 

25 in a phage, plasmid, phagemid, or other viral vector {see, for example. Sambrook et 
al., supra, Ausabel, et al., supra). DNA representing RNA or genomic nucleic acid 
sequence may also be obtained by amplification using sets of primers complementary 
to 5' and 3' sequences of the cDNA sequence, such as presented in SEQ ID NOs: 1 and 
3. For ease of cloning, restriction sites may also be incorporated into the primers. 
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Variants (including alleles) of the ENDl and NUCl gene products 
provided herein may be readily isolated from natural variants {e.g,, polymorphisms, 
mutants), synthesized or constructed. Many methods have been developed for 
generating mutants {see. generally^ Sambrook et al., supra\ Ausabel, et al., siq}rd). 

5 Briefly, preferred methods for generating a few nucleotide substitutions utilize an 
oligonucleotide that spans the base or bases to be mutated and contains the mutated 
base or bases. The oligonucleotide is hybridized to complementary single stranded 
nucleic acid and second strand synthesis is primed from the oligonucleotide. The 
double-stranded nucleic acid is prepared for transformation into host cells, typically 

10 E. coli, but alternatively, other prokaiyoles, yeast or other eukaryotes. Standard 
screening and vector growth protocols are used to identify mutant sequences and 
obtsdn high yields. 

Similarly, deletions and/or insertions of the ENDl and NUCl genes 
may be constructed by any of a variety of known methods. For example, the gene can 

15 be digested with restriction enzymes and religated such that sequence is deleted or 
religated with additional sequence such that an insertion or large substitution is made. 
Other means to generate variant sequences may be found, for example in Sambrook et 
al. {supra) and Ausubel et al. {supra). Verification of variant sequences is typically 
accomplished by restriction enzyme mapping, sequence analysis, or probe 

20 hybridization. 

B. Promoter Regions of end 1 and nucI genes 

As described above, this invention provides promoters of ENDl and 
NUCl genes and uses of the promoters in controlling foreign gene expression. 

25 Briefly, the promoter regions of ENDl and NUCl genes are initially 

identified in a genomic clone by DNA sequence analysis. The sequence from a 
cDNA is located in the genomic clone by direct DNA sequence analysis, restriction 
mapping, RNase protection, SI nuclease analysis, or other similar methods. The 
DNA sequence upstream of the transcribed region is determined. Because promoters 

30 are relatively close to transcribed sequences, identification of a promoter may be 
made by DNA sequence analysis and/or by functional analysis. 
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Identification of classical consensus promoter sequences, such as a 
TATA box and CAAT box, located adjacent to and upstream of the coding region 
indicate the presence of a promoter. In addition, other regulatory elements in Ae 
promoter region, such as an enhancer, wound inducible factor sequences and the like, 
5 that have identifiable sequences may be present Verification of a promoter by 
functional analysis is assessed by cloning the putative promoter sequence into a 
plasmid containing a reporter gene, transfecting a cell capable of utilizing the 
promoter, and detecting expression of the reporter gene. Briefly, a DNA fragment 
containing the candidate promoter sequence is inserted into a vector containing a 

10 reporter gene, such as pBI12I, which contains GUS (|J-glucuronidase gene) under 
control of the CaMV 35S promoter (Jefferson et al., EMBOJ. 6: 3901 -3907, 1 987), in 
place of the promoter. If a promoter is present on the inserted fragment, transcription 
of GUS will ensue in cells that can activate the promoter. GUS activity is detected by 
enzymatic activity on any one of the numerous, readily assayable GUS substrates. 

15 One convenient substrate is the colorometric substrate 5-bromo-4-chloro-3-indoyl- 
glucuronide sodium salt. Other substrates that give a colorometric or fluorogenic 
signal may be readily obtained and used. 

This method may be generalized to assay promoter activity of any 
candidate promoter sequence on a DNA fragment by constructing a vector containing 

20 the DNA fragment upstream of a suitable reporter gene. Reporter genes include, but 
are not limited to, GUS, luciferase, P-galactosidase, and green fluorescent protein. 
Preferably, the reporter gene has no endogenous counterpart in the host cell, or any 
endogenous gene is expressed minimally or not at all. Examples of suitable 
constructs are discussed below. 

25 The ENDl or NUCl promoter in this construct may be tested directly 

in plants, preferably in cereal plants. Briefly, for gene promoters described herein, 
the construct described above is used to transform plants, such as tobacco leaf discs, 
via Agrobacterium rwme/ac/enj-mcdiated transformation. GUS activity is then 
detected in tissue sections of transformed plants. By assaying different plant tissues, 

30 a pattern of expression is established. Typical plant tissues for assay include floral 
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regions of the plant, such as developing and germinating seed, vasculature, 
photosynthetic tissues, such as mature leaves, and stems. Alternatively, promoter 
activity may be tested in a transient expression assay in plants. In such a method, a 
construct containing the candidate promoter region is placed upstream of a reporter 
5 gene (e.^., GUS, luciferase» p-galactosidase. green fluorescent protein) and 
transfected into plant cells by bombardment {see^ GUS Protocols: Using the GUS 
gene as a reporter of gene expression^ Gallagher (ed.), Academic Press, Inc., 1992). 
Reporter activity is measured a few hours later. When using this method, the plasmid 
or DNA may be an Agrobacterium-hascd plasmid, pUC-based plasmid, or' other 

10 vector. Various plant tissues may be used, which will provide some information 
regarding cell-specificity of the promoter. Approximately 100 mg of tissue is needed 
for bombardment. 

Minimal promoter sequences and variant promoter sequences are 
constructed by standard techniques, such as deletion by restriction digestion or other 

15 nucleolytic digestion techniques. By making progressively larger deletions from the 
5' end of the initial fragment and assaying each deletion mutant for promoter activity, 
the fninimal promoter region is determined. In general, promoter activity will be 
found in the first 200 to 500 bp upstream of the transcription initiation site. In some 
cases, the promoter may extend further or less far. 

20 

C. Foreign Genes 

As discussed above, this invention provides vectors for the expression 

of foreign or heterologous genes under control of endosperm-specific or nucellus- 

specific promoters, such as the ENDl and NUCl promoter regions. Within the 
25 context of this invention, a foreign gene is any gene sequence other ENDl or NUCl, 

including for example, other proteins, antisense sequences, or ribozyme sequences. 

The foreign genes are under control of an ENDl or NUCl promoter. 

ENDl is expressed in cells at the gateway to the endosperm as early as 5 dap (days 

after pollination). NUCl is expressed in the nucellus. As such, the promoters of 
30 these genes may be used to alter the flux of various storage proteins or engineer 

disease resistance. For example, expression of storage polysaccharides prior to starch 
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deposition, specific proteins, en2ymes, or molecules that act as sinks for other 
proteins may be manipulated to occur earlier in development than nomial. This 
disease resistance may be engendered by using the ENDl and/or NUCl promoter to 
drive the expression of (1) enzymes that degrade pathogen-produced molecules (e.g., 
5 HC-toxin reductase, fiamonis in reductase); (2) enzymes that suppress pathogen 
growth (e.^., viral coat proteins, chitinase); (3) enzymes that produce substances that 
suppress pathogen growth (c.^., glucose oxidase); (4) toxin pumps that render a plant 
insensitive to pathogen-produced molecules (e.g., cercosporin pump, HC-toxin pump, 
DON pump); and (5) proteins that activate plant defenses (e.g., bacterial opsin"). The 
10 choice of the foreign gene depends in part upon the desired result. For example, 
when disease or insect resistance to a pest or pathogen, a preferred gene is specific to 
the disease or insect. 

As noted above, the present invention provides vectors capable of 
expressing genes under the control of the ENDl or NUCl promoter. In general, the 
15 vectors should be functional in plant cells, although at times, it may be preferable to 
have vectors that are fimctional in E. coli or other organisms (e.g, production of 
protein for raising antibodies, DNA sequence analysis, construction of inserts, 
obtaining quantities of nucleic acids). Vectors and procedures for cloning and 
expression in E, coli are discussed herein and, for example, in Sambrook et al. (supra) 
20 and in Ausubel et al. (supra). 

Vectors that are fimctional in plants are preferably binary plasmids 
derived fiiom Agrobacierium plasmids. Such vectors are capable of transforming 
plant cells. These vectors contain left and right border sequences that are required for 
integration into the host (plant) chromosome. At minimum, between these border 
25 sequences is the gene to be expressed under control of a promoter. In preferred 
embodiments, a selectable marker and a reporter gene are also included. For ease of 
obtaining sufficient quantities of vector, a bacterial origin tiiat allows replication in £. 
coli is preferred. 

The vector should contain a promoter sequence. Preferably, for 
30 expression of a foreign gene, the promoter is an ENDl or NUCl promoter. The 
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entire promoter region does not need to be in the vector. However, the vector should 
contain at least the minimum sequence to promote transcription of the associated 
gene. Delineation of the minimum sequence is discussed above. In general, a 
minimum promoter region is contained within the first approximately 200 bases 
5 upstream of the transcription start site, but may extend for 300-500 bases. As well, 
variants of the promoter region may be used as long as at least 1% of native promoter 
activity is retained, and more preferably, greater than 10%, 20%, or 50% of native 
promoter activity. As provided herein, variants may be the result of natural 
polymorphisms, or synthesized mutants. Variants may also be derived from highly 

10 related promoter sequences, such as ENDl orNUCl promoter sequences from other 
plants, isolated by hybridization for example. 

In certain preferred embodiments, the vector contains a reporter gene. 
The reporter gene should allow ready determination of transformation and expression. 
The GUS ((5-glucoronidase) gene is preferred (U.S. Patent No. 5,268,463). Other 

15 reporter genes, such as p-galactosidase, luciferase, green fluorescent protein, and the 
like, are also suitable in the context of this invention. Preferably, the reporter gene 
does not have an endogenous counterpart. Methods and substrates for assaying 
expression of each of these genes are well known in the art. The reporter gene should 
be under control of a promoter that is functional in plants. Such promoters include 

20 CaMV 35S promoter, mannopine synthase promoter, ubiquitin promoter and DNA J 
promoter. 

Preferably, the vector also contains a selectable marker for identifying 
transformants. The selectable marker may confer a growth advantage under 
appropriate conditions. Generally, selectable maricers are drug resistance genes, such 
25 as neomycin phosphotransferase. Other drug resistance genes are known to those in 
the art and may be readily substituted. The selectable marker has a linked promoter, 
which may be constitutive or inducible, and a termination sequence, including a 
polyadenylation signal sequence. 

A bacterial origin of replication and a selectable marker for bacteria 
30 are preferably also included in the vector. Of the various origins readily available 
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(e.g., colEI, fd phage), a colEI origin of replication is preferred. Most preferred is the 
origin from the pUC plasmids, which allow high copy number. 

A general vector sttitable for use in the present invention is based on 
pBI121 (U.S. Patent No. 5,432,081) a derivative of pBIN19. Other vectors have been 
5 described (U.S. Patent No. 4,536,475) or may be constructed based on die guidelines 
presented herein. The plasmid pB1121 contains a bacterial origin of replication, 
selectable marker, and a left and right border sequence for integration into a plant 
host chromosome. These border sequences flank two genes. One is a kanamycin 
resistance gene (neomycin phosphottansferase) driven by a nopaline synthase 
1 0 promoter and using a nopaline synthase polyadenylation site. The second gene is the 
K coll GUS gene under control of the CaMV 35S promoter, which is polyadenylated 
using a nopaline syntase polyadenylation site. The CaMV 35S promoter is excised 
by appropriate restriction digestion. For example, a Hind III/BomH I double 
digestion or a partial Sph V complete Xba I digestion will excise the CaMV promoter 
15 sequence. The ENDl or NUCl promoter is then inserted in its place. These 
promoter sequences may be obtained by amplification, automated synthesis or 
isolated from a clone. Compatible cloning restriction sites are added by standard 
methodologies, such as the addition of adaptors or linkers. The vector is transformed 
into bacteria, propagated, and/or transformed into plants. 

20 

D. Plant Transformation Methods 

As discussed above, the present invention also provides methods for 
producing a plant which expresses a foreign gene, comprising the steps of (a) 
introducing a vector as described above into an embryogenic plant cell, w4ierein the 

25 vector contams a foreign gene in an expressible fonn, and (b) producmg a plant from 
the embiyogenic plant cell, vtiierein the plant expresses the foreign gene. 

Vectors may be introduced into plant cells by any of several methods. 
For example, DNA may be introduced as a plasmid by Agrobacterium in co- 
cultivation or bombardment. Other transformation methods include electroporation, 

30 CaP04-mediated transfection, and the like. Preferably, DNA is first transfected into 
Agrobacierium and subsequently introduced into plant cells. Most preferably, the 
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infection is achieved by co-cultivation. In part, the choice of transformation methods 
depends upon the plant to be transformed. For example, monocots generally cannot 
be transformed by Agrobacterium. Thus, Agrobacterium transformation by co- 
cultivation is most ^ropriate for dicots and for mitotically active tissue. Non- 
5 mitotic dicot tissues can be efficiently infected by Agrobacterium when a projectile or 
bombardment method is utilized. Projectile methods are also generally used for 
transforming sunflowers and soybean. Bombardment is used when naked DNA, 
typically Agrobacterium or pUC-based plasmids, is used for transformation or 
transient expression. 

10 Briefly, co-cultivation is performed by first transforming 

Agrobacterium by freeze-thawing (Holsters et al., Mol. Gen. Genet. 163: 181-187, 
1978) or by otiier suitable methods (see, Ausubel, et al., supra\ Sambrook et al., 
supra). Agrobacterium containing the plasmid are grown overnight at 28'*C with 
continuous agitation in YEP medium in the presence of kanamycin (when the 

15 selectable marker is kan"^) or colony selected and purified on agar-containing 
medium. An aliquot or colony is grown to mid-log phase {e.g., OD=0.5) in medium 
containing 12.5 mM MES, 1 g/L NH4C1 and 0.3 g/L MgSO^ at pH 5.7. 
Approximately 10' cells/ml Agrobacterium is incubated with sterile leaf disks, 
protoplasts or meristematic tissue for 1 hr. The discs are then washed in sterile 

20 distilled water and cultivated on standard plant tissue culture medium containing 
kanamycin. 

For microprojectile bombardment, seeds are surface sterilized in 20% 
bleach solution with two drops of Tween 20 per 50 ml for 30 min and rinsed twice 
with distilled water. Seeds are then imbibed in distilled water for 60 min, and the 

25 cotyledons are broken off to produce a clean fiacture at the plane of the embryonic 
axis. The explants are then bisected longitudinally between the primordial leaves. 
The explants are placed cut surface up on GBA medium with mineral and vitamin 
additives, sucrose, indole-3-acetic acid, gibberellic acid and phytagar. Thirty to forty 
explants are placed in a circle at the center of a 60 mm plate and bombarded with 

30 approximately 4.7 mg of 1.8 ^m tungsten microprojectiles in 1.5 ^ll aliquots by a 
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PDS 1000® particle acceleration device. Each plate is bombarded twice through a 
150 ^xn Nytex screen placed about 2 cm above the samples. Freshly bombarded 
explants are placed in a suspension of transfomied Agrobacterium for 30 min and 
then transferred to GBA medium with the cut surfaces down for 3 days with an 18 hr 
5 light cycle. Explants are transferred to medium lacking growth regulators but 
containing drug for selection and grown for 2-5 weeks. After 1-2 weeks more 
without drug selection leaf samples from green, kanamycin-resistant shoots are 
assayed for the presence of neophosphotransferase activity. Positive shoots may be 
grafted to in vitro grown rootstock and transferred to soil. 

10 Within one embodiment of the invention, a genomic DNA sequence 

containing the ENDl or hOJCl promoter region is placed upstream from the E. coli 
p-glucuronidase gene in the plasmid pBI121 (Jefferson et al., EMBO J. 6:3901-3907, 
1987). in place of die CaMV 35S promoter. This construct is used to transform 
tobacco leaf discs via Agrobacterium tumefaciens-mcdiztcd transformation as 

15 described herein. Histochemical analysis of tissue sections is performed by adding 
the substrate 5-bromo-4-chloro-3-indoyl-glucuronide sodium salt in buffer containing 
potassium fenrocyanide. Plant tissue is incubated in this mixture at 37'*C overnight 
and P-glucuronidase activity is determined colorimctrically (Jefferson, Plant 
Molecular Biology Reporter 5:387-405, 1987). 

20 Any plant that would benefit fh>m expression of a foreign gene is 

suitable for transformation within the context of this invention. Such plants include 
barley, com, oat, wheat, soybean, canola, sunflower and the like. The method of 
transformation depends, at least in part, on whether the plant is a dicot or monocot. 

The following examples are offered by way of illustration, and not by 

25 way of limitation. 
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EXAMPLES 

EXAMPLE 1 
Isolation OF NUCl and END 1 Genes 

5 

cDNA clones of transcripts expressed in the endosperm coenocyte and 
the nucellus are isolated. Briefly, a cDNA library based on poIy(A)-rich RNA of 
S DAP intact ovaries is screened with a positive probe from isolated embryo sacs 
with appending testa and nucellus cell layers (Fig. ID) and a negative probe from the 
1 0 remaining pericarp. 

Plani Material 

Barley (Hordeum vulgare L. cv. Bomi) is grown under controlled 
environmental conditions of \5^C during 16 h light periods and 10°C during 8 hr 

15 darkness. At 5 DAP, the barley ovary is 5 mm long (Fig. I A) and consists of pericarp, 
testa, nucellus and the endosperm coenocyte at the beginning of the arboreal stage 
(Fig. IB). The pericarp has an outer epidermis, multiple layers of parenchyma cells, 
two layers of photosynthetic cross cells and an innermost loose layer of crushed cells, 
which permits mechanical separation of the pericarp from the imderlying tissues (see 

20 below). On the dorsal side of the 5 DAP grain, the nucellus is ahnost completely 
degraded, leaving only the nucellar epidermis. On the ventral side, in addidon to 
nucellus itself, the nucellar projection is the dominating tissue. At 8 DAP, when the 
endosperm has reached the cellular stage, it is enclosed in the nucellar epidermis, 
which is one cell layer thick (Fig. IC). Hand-pollinated grains are harvested at 

25 appropriate developmental stages, rapidly frozen in liquid nitrogen and stored at 
-80**C. Individual 5 DAP ovaries are thawed for manual separation of pericarp 
(negative probe) from the embryo sac with appending testa and nucellus layers 
(positive probe). After dissection, tissues are rapidly refrozen and stored at -80**C. 
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Isolation of cDNA clones 

A cDNA library of 5 DAP intact ovaries is constructed from poIy(A)- 
Miriched RNA by oligo dT-priming and insertion into EcoRl-Xhol digested XZAPH 
vector (Clonetech, Palo Alto, CA). The library consists of 1.5 x lO* independent 
5 clones, with more than 90% of the recombinant plaques containing inserts. About 
6000 plaques from this library are absorbed to E. coU BB4 cells, plated on LB plates 
and grown overnight DNA is transferred to GeneScreen membranes, and the 
membranes are denatured, neutralized, and affixed to the membranes {see, Sambrook 
et al., supra; Ausubel et «il., supra). 
10 Positive and negative probes for differential screening are prepared 

from total RNA extracted from embryo sacs with adhering testa and nucellus, and 
pericarp. Poly(A)-rich RNA is isolated from 70 ng total RNA using oligo(dT)j, 
primer linked to magnetic beads (Dynabeads) according to the manufacmrer's 
instruction (Dynal, Oslo, Norway). Poly(A>rich RNA is incubated wdth Superscript 
15 reverse transcriptase (200 units) in the presence of oligo(dT) (50ng). a mixture of 
dATP. dGTP and dTTP (1.25 mM each), dCTP (0.02 mM), ^^P a-dCTP (10 mCi/ml, 
3000 Ci/mmol; Amersham, UK) in a final volume of 30 jil buffer (20 mM Tris-HCl 
buffer, pH 8.4, 50 mM KCl, 2.5 mM MgClj 0.1 mg/ml BSA and 10 mM DTT). The 
reaction is incubated sequentially at 25°C for 10 min, 42'C for 50 min, PO'C for 5 
20 min and 4°C for 10 min followed by 20 min at 37'»C in the presence of 10 units of 
RNAseH. 

Di£ferential screening b performed by hybridizing pericarp-specific 
probe and embryo sac-spcdfic probe to duplicate filters. Prior to hybridization, filters 
are incubated for 30 min at 42»C in 0.4 M NaOH followed by 30 min at 42«'C in 0.1 x 

25 SSC/0. 1% SDS (w/v). Conditions for hybridization are 68 »C in 6x SSC, 0. 1 % (w/v) 
SDS, 5x Denhardt's solution, 1 mg/ml salmon sperm DNA and 2 x 10' cpm/ml 
"P-Iabeled single-stranded cDNA probe. Filters are washed in 6x SSC and 0.1% 
(w/v) SDS at 25"'C (2x30 min) followed by 68''C(2x30 min). Filters are then exposed 
to X-ray film at -80 "C for 16 h. Individual plaques that hybridized exclusively with 

30 the positive probe are picked and rescreened using the same probes as above. 
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Confirmed positive phages are excised and converted into Bluescript recombinants 
using R408 helper phage. 

Forty-four clones hybridizing only to the positive probe were selected 
and grouped into homology classes. Of these classes, six contain two or more clones, 
5 whereas the remaining are individual clones that do not hybridize to other positive 
clones. 

Sequence analysis 

cDNA inserts are excised from recombinant plasmids by restriction 
digestion with £coRI and Xhol and subcloned into M13 mpl8 and mpl9. The DNA 
10 sequence is determined on both strands using a Sequenase kit (U.S. Biochemical 
Corporation, OH). Nucleotide and amino acid sequence analyses are jjerformed using 
the software packages PC/GENE (IntelliGenetics, CA) and GCG (University of 
Wisconsin). 

The sequence of the partial ENDl cDNA from the differential 
15 screening experiment used as a probe in the experiments reported here is shown in 
Fig. 4A (SEQ ID NO: 1). The longest open reading frame is presented in SEQ ID 

NO:3. 

The sequence of the NUCl cDNA isolated in the differential screening 
experiment is shown in Fig. 4B (SEQ ID NO:2). The open reading fiMies are 
20 presented in SEQ ID NOS:4-6. The function of the NUCl cDNA remains unknown, 
as no homologous nucleotide or protein sequence related to NUCl is present in 
current databases. 

EXAMPLE 2 

25 Expression of END 1 and NUC 1 Transcripts 

In cereal grains, such as barley, the endosperm is the main storage 
organ providing nutrients to the developing and to the germinating embryo. 
Endosperm ontogeny begins with formation of a diploid embryo by fusion of a sperm 
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nucleus with the egg cell. Subsequently a triploid endospenn is formed via fusion of 
a spenn nucleus to the central cell nuclei. 

Pre- and post-fertilization structure and development of the 
megagametophyte in barley is fairly well known (Cass et al.. Can. J. Bot. 63: 2164, 
5 1985; EngeU, Nord. J. Bot. 9: 265, 1989; Engell, Sex Plant Reprod 7: 333. 1994). 
Briefly, barley endosperm development is divided into the syncytial (0-6 days after 
poUination (DAP), ceUuIarization (6-8 DAP), diflferentiation (8-21 DAP) and 
maturation (21-40 DAP) stages. In development, cytokinesis is uncoupled from 
kaiyokinesis in the initial stage leading to a large undifferentiated coenocytic' stage. 
10 After fertilization, three days of rapid nuclear divisions give rise to a population of 
nuclei evenly distributed in the thin layer of cytoplasm surrounding the vacuole of the 
central cell. The rate of RNA sjrntbesis increases approximately sevenfold during the 
syncytial stage (Bosnes et al.. Plant J 2: 661, 1992). indicating that endosperm 
development is driven by transcripts synthesized de novo fiom endospenn nuclei. At 
15 5 to 6 DAP, the individual domains undergo marked polarization along an axis 
perpendicular to the wall of the former central cell. At 6 DAP, the first anticlinal cell 
walls form in the interzones among nucleocytoplasmic domains. In the following 
stage, the nuclei enclosed in alveoli of anticlinal walls divide synchronously. 
Interzonal phragmoplasts then form between separating daughter nuclei, producing 
20 the first periclinal wall. Following this round of division, nuclei positioned next to 
the central vacuole give rise to the starchy endosperm through continued rounds of 
cell division at random orientation, lasting until 14 DAP (Brown et al.. Plant Cell 
(J:1241, 1994). 

Within die ovule, the megagametophyte is embedded in the nucellus. 

25 believed to be a homologue of the cryptogame megasporangium in which the number 
of reproductive cells is reduced to only one. At fertilization, the nucellus consists of 
several cell layers, of which the intermost layer(s) has begun autolysis (Norstog, Bot. 
Gazette 33: 97, 1974). During the following five days of development, all except die 
outermost nucellar epidermis cell layer disappear, presumably contributing nutrients 

30 to the endosperm and the embryo. Over the main ventral vascular strand, nucellus 
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develops into the nucellar projection* which is responsible for the phloem unloading 
of assimilates from the sporophyte (Cohrane and Duffiis, Protoplamsa 103: 361, 
1980). 

5 END! transcripts are differentially expressed in the endosperm coenocyte over the 
nucellar projection 

In transverse sections of 6 DAP grains, an ENDl probe hybridizes 
exclusively to transcripts in the endosperm in the area over the nucellar projection 
(Fig. 2A). At 6 DAP, the endosperm is at the arboreal stage, and is thickest on the 

10 ventral side (see Fig. IB). No signal is detectable in the dorsal or lateral parts of the 
coenocyte. As can be seen at higher magnification, the distribution of the ENDl 
transcript is sharply limited to the ventral part of the coenocyte (Fig. 2B). The 
distribution of silver grains for the ENDl probe is bimodal, reflecting the structure of 
the ai1>oreaI endosperm, with a denser accumulation at the top and the bottom than in 

15 the central part. No signal is observed in in situ hybridization experiments using an 
ENDl sense probe. 

In situ hybridization analysis shows that ENDl transcripts are present 
in the cellular endosperm in modified aleurone cells and a few layers of ventral 
starchy endosperm cells adjacent to the nucellar projection (Fig. 2C and D). In 

20 addition, at 8 DAP, a weak signal is detectable also at the dorsal side of the 
endosperm (Fig. 2C). At 10 DAP, a dorsal signal is not detected. 

By Northern blot analysis of poly(A)-rich RNA from whole grains, a 
single ENDl transcript of approximately 920 nucleotides is present in tissue from 5 
DAP to 30 DAP, when grain dessication is well underway (Fig. 3, upper panel). The 

25 signal is weak during the coenocytic stage, ^en the endosperm represents a very 
small fiaction of the grain. Subsequently, the signal increases considerably as the 
endosperm becomes cellular at 8 DAP. After this stage, ENDl transcripts are present 
at a constant steady state level throughout the differentiation and maturation stages, 
decreasing between 25 and 30 DAP. 
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Taken together, the data suggest that the modified aleurone cells and 
the ventral starchy endosperm cells that express END I transcripts are derived from 
the part of the endosperm coenocyte which expressed the ENDl transcript prior to 
cellularization. Apart from a weak signal in Nordiem blots containing poly(A)-rich 
5 RNA from 15 and 20 DAP embryos, which may be caused by small adhering pieces 
of endosperm tissue, no ENDl signal is not detectable in any other plant tissue 
examined, including leaf, stem, root, anthers and immature spikes. 

NUCI transcripts are differentially expressed in the nucellus, nucellar epidermis and 

10 in the nucellar projection 

In differential screening, the hybridization pattern of NUCl cDNA is 
indistinguishable from that of ENDl, giving a positive signal with probe derived 
from 5 DAP embryo sacs with adhering testa and nucellus cell layers, and no signal 
with the pericarp probe. However, in contrast to the endosperm specific ENDl clone, 

15 NUCl is detectable by Northern blot analysis of polyA-rich RNA of unfertilized 
ovules and thereafter from 1 and 6 DAP (Fig. 3, lower panel). The size of the NUCl 
transcript is approximately 860 nucleotides. Unless extremely abundant in the central 
cell and in tfie young endosperm coenocyte, a transcript of such high relative 
abundance is most likely to be located in the major sporophytic tissues of the ovule, 

20 (/. e. , testa or nucellus). 

As shown in Fig. 2E, in situ hybridization analysis confirms that 
NUCl is differentially expressed in the nucellus. No signal is detected in antipodal 
cells, the central cell, or in the pericarp of unfertilized ovaries. According to Norstog 
(Bot. Gazette 35:97-103, 1974), nucellus degradation is initiated before fertilization, 

25 progresses rapidly after fertilization, and is completed around 6 DAP. Thus, the 
signal between 1 and 6 DAP by in Northern blot analysis comes from transcripts in 
the degrading nucellus, as shown for the 2 DAP gram in Fig. 2F, Thereafter, when 
the nucellar parenchyma cells are completely degraded, the NUC 1 signal detected in 
Northern blot analysis is derived from transcripts located in the nucellar epidermis 

30 and the nucellar projection (Fig. 2G and H). In addition to nucellar tissues. Northern 
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blot analysis detects NUCl transcripts at a low steady-state level in immature and 
mature anthers, but not in developing or germinating embryos, leaves, stems, 
immature spikes or roots. 

5 In situ hybridization and microscopy 

In situ hybridization is performed as described (Aalen et al., Plant J. 5: 
385-396, 1994), except that tissue was fixed in 3.7% formaldehyde, 5% acetic acid 
and 50% ethanol, sections are cut 15-18 ^m thick, and autoradiograms are exposed 
for 6-7 weeks. 

10 

Northern blot hybridization 

PoIy(A)-rich RNA bom various grain and vegetative tissues is isolated 
using magnetic oligo(dT) beads (Dynal, Norway) (Jakobsea et al., NucL Acids Res. 
18: 3669, 1990). Approximately 100 ng of poly(A)-rich RNA from each sample is 

15 separated by 1.4% agarose gel electrophoresis and transferred onto nylon membrane 
filter (Amersham, UK) (see, Sambzook et al.. Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory Press. Cold Spring Harbor, NY, 1989). 
Single-stranded antisense ^^-cDNA probes are generated by amplification of cDNA 
insert using ICS and biotinylated T3 primers (Stratagene, La JoUa, CA) and 

20 biotinylated single-stranded cDNA bound to magnetic beads (Dynal, Norway) as 
template. ^^P-labeled probe is synthesized using a random primer labeling kit 
(Rediprime, Amersham) and a*^-dCTP (Amersham). Northern filters are 
hybridized with ^^P-labeled probe (1x10* cpm/ml) at 42^'C in the presence of 50% 
formamide and washed m 2x SSC and 1% SDS at 25''C (2x30 min), followed by 

25 washes in 0.2x SSC and 0.1% (w/v) SDS at Si^'C (2x30 min). Transcript sizes on the 
Northern blot are estimated using RNA molecular weight marker I from Boehringer 
Mannheim on each gel. Filters are exposed for 1 to 3 days. The histone H3 probe 
used in Fig. 3 to monitor gel loading is isolated fi*om the same cDNA library as used 
in the differential screening experiment. 
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EXAMPLES 

Isolation of Promoter Region of ENDl and NUCl Genes 

5 The promoter regions of the ENDl and NUCl genes are isolated fiom 

genomic clones. Briefly, a genonoic library is constructed in lambda Dash-2 vector 
(Stratagene. La Jolla, CA) or the EMBL3 vector and packaged in vitro. Phage 
particles are absorbed on E. coli and plated on LB plates. Following overnight 
growth at 37**C, nylon filters are dropped onto chilled plates. Filters are base treated, 

10 neutralized, and UV-treated {see, Ausubel et al., supra). Fragments of ENDl and 
NUCl are radiolabeled with ^^P-dCTP by random priming and synthesis using RTase. 
Filters are hybridized with labeled probe, washed, and exposed to X-ray film. 
Hybridizing clones are isolated and propagated. Gene inserts are mapped by 
restriction enzyme and Southern blotting. Suitable fragments are identified and 

15 subcloned. The portion of the insert containing 5* region is isolated by amplification 
using a primer corresponding to vector sequence and a primer complementary to the 
5' region of the ENDl and NUCl gene sequence. 

The longest amplified fragments are selected for further study. The 
fragment is inserted into pBS+ phagemid vector (Stratagene Cloning Systems, La 

20 Jolla, CA). Single-stranded phage are recovered after rescue with K07 helper virus. 
The phagemid DNA is subjected to sequence analysis. 

It will be appreciated that, although specific embodiments of the 
invention have been described herein for purposes of illustration, various 
modifications may be made without departing firom the spirit and scope of the 

25 invention. Accordingly, the invention is not limited except as by the appended 
claims. 
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SEQUENCE LISTING 



<1) GENERAL INFORMATION: 

5 

(i) APPLICANTS: Olsen, Odd-Arne 
Doan, Danny N.P. 
Linnestad, Casper 

10 (ii) TITLE OF INVENTION: ENDOSPERM AND NUCELLOS SPECIFIC GENES, 

PROMOTERS AND USES THEREOF 

(iii) NDMBER OF SEQUENCES: 6 

15 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SEED and BERRY LLP 

(B) STREET: 6300 Coliirabia Center, 701 Fifth Avenue 

(C) CITY: Seattle 

(D) STATE: Washington 
20 (E) COUNTRY: USA 

(F) ZIP: 98104 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

25 (B) COMPUTER: IBM PC con^atible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patent In Release #1-0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 
30 (A) APPLICATION NUMBER: 

(B) FILING DATE: 30-AUG-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 
35 (A) NAME: Nottenburg Ph.D., Carol 

(B) REGISTRATION NUMBER: 39,317 

(C) REFERENCE/DOCKET NDMBER: 750027.402 

(ix> TELECOMMUNICATION INFORMATION: 
40 (A) TELEPHONE: (206) 622-4900 

(B) TELEFAX: (206) 682-6031 



45 



(2) INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 581 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
50 (D) TOPOLOGY: linear 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ACATTATAAG ATTCTAACAT CAAAGGCATA GAGACGGCTC TAGTGTAGCA GACACAAAAG 60 

60 CCATAGCCAT GGCGAAACTC ATGTGCTTAT GCTTCATCAT CCTCACTATT GCGGTAGTCG 120 

TGTCAGCTGG CGGATGCGAC GGTGATCGAG AAGACATGAT CAGGGAGTGT GGTAAGTATC 180 
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AGAAATTCCC AGCAGAGCCG AAGCTAGCTC CATCAGATGC GTGCTGCGTC GTGTGGCATA 24 0 

AGGCGAACAT CCCATGCCTT TGCGCTGGTG TCACCAAGGA GAAAGAGAAG ATATGGAGCA 300 

TGGAGAAGGT TGGCTACGTT GCCAATTTCT GCAAGAAGCC GTTCCCACAT GGCTACAACT 360 

GTGGAAGTTA CACATTCCCT CCTCTAGCGT AGTGACTAAT TTATCAGCGG GAGAAGCGTC 420 

GGCTTTTACA TTCCATGTTG CTGGCCTCAC AACTU^CTTTC CTTTGAGATA TGTAATTTGT 480 

GAGGTTGTGC TAAAATAAGT TTATGTTAAC CATTATGTGA GAACGTTTGT TATAATAAAG 54 0 

TCTTCTCCTC AAATTGTCTA AAAAAAAAAA AAAAAAAAAA A 581 
(2) INrORMATION FOR SCQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



CTTCCCTTCG 


CCAGCACGAG 


CAAGGAGATG 


GAGGGGAAGA 


AGAGAGCGGC 


GGCCATTGCC 


60 


GCCCTGTGCA 


TGGTCCTACT 


CCTCATGCAG 


GCAAGGCCAT 


CCCATCAGCA 


GTTCTCCGAC 


120 


TACGCCTGCG 


AGTGCATCCG 


GCAGTGCTAC 


CCCGCGTGCA 


GGGACAGCAC 


CCCGCCGTGG 


180 


CTCTGCAAGA 


TCAAGTGCoC 


CGGTAGCTGC 


CACAACGGCG 


ACAGGAAGGA 


CGCACTTACC 


240 


GCCTGCAGGA 


TCGCCTGCCT 


CACGAGCCCC 


GTCTGCGGCC 


TGTCGACACC 


GCCCGTCGCT 


300 


CCAGGTGATG 


TTGATCCTTG 


TACCAGGGAG 


TGCGACAAGC 


TGTGGGGTGG 


CCATGGTCAC 


360 


GCCAAGGAAC 


CTTGAAGAGC 


GACGAGCCGG 


TCGATCCGGG 


ATCACATGAT 


GTCGAATGGC 


420 


GATGATTGCT 


GCACCAATCA 


ATAAAATAAA 


ATGTATTTCG 


CTTCGGTGCA 


TCACTTGTGA 


480 


GAAAGTTTGT 


GGGTCTTAGA 


AGTAAAAGTA 


AAGATGGTTA TCGACGCATT 


TGGCGGAAAA 


540 


AAAAAAAAAA AAAAAAA 










557 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

His Gin Arg His Arg Asp Gly Ser Ser Val Ala Asp Thr Lys Ala lie 
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1 5 10 15 

Ala Met Ala Lys Leu Met Cys Leu Cys Phe He He Leu Thr He Ala 
20 25 30 

Val Val Val Ser Ala Gly Gly Cys Asp Gly Asp Arg Glu Asp Met lie 
35 40 45 

Arg Glu Cys Gly Lys Tyr Gin Lys Phe Pro Ala Glu Pro Lys Leu Ala 
10 50 55 60 

Pro Ser Asp Ala Cys Cys Val Val Trp His Lys Ala Asn He Pro Cys 
€5 70 75 80 

15 Leu Cys Ala Gly Val Thr Lys Glu Lys Glu Lys He Trp Ser Met Glu 

B5 • 90 95 

Lys Val Gly Tyr Val Ala Asn Phe Cys Lys Lys Pro Phe Pro His Gly 
100 105 110 

20 

Tyr Asn Cys Gly Ser Tyr Thr Phe Pro Pro Leu Ala 
115 120 

25 (2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

35 

Leu Pro Phe Ala Ser Thr Ser Lys Glu Met Glu Gly Lys Lys Arg Ala 
15 10 15 

Ala Ala He Ala Ala Leu Cys Met Val Leu Leu Leu Met Gin Ala Arg 
40 20 25 30 

Pro Ser His Gin Gin Phe Ser Asp Tyr Ala Cys Glu Cys He Arg Gin 
35 40 45 

45 Cys Tyr Pro Ala Cys Arg Asp Ser Thr Pro Pro Trp Leu Cys Lys He 
50 55 60 

Lys Cys Ala Gly Ser Cys His Asn Gly Asp Arg Lys Asp Ala Leu Thr 
65 70 75 80 

50 

Ala Cys Arg He Ala Cys Leu Thr Ser Pro Val Cys Gly Leu Ser Thr 
85 90 95 

Pro Pro Val Ala Pro Gly Asp Val Asp Pro Cys Thr Arg Glu Cys Asp 
55 100 105 110 

Lys Leu Trp Gly Gly His Gly His Ala Lys Glu Pro 
115 120 

60 (2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Phe Pro Ser Pro Ala Arg Ala Arg Arg Trp Arg Gly Arg Arg Glu Arg 
10 1 5 10 15 

Arg Pro Leu Pro Pro Cys Ala Trp Ser Tyr Ser Ser Cys Arg Gin Gly 
20 25 30 

15 His Pro lie Ser Ser Ser Pro Thr Thr Pro Ala Ser Ala Ser Gly Ser 
35 40 45 

Ala Thr Pro Arg Ala Gly Thr Ala Pro Arg Arg Gly Ser Ala Arg Ser 

20 '° " ^° 

Ser Ala Pro Val Ala Ala Thr Thr Ala Thr Gly Arg Thr His Leu Pro 
65 70 75 80 

Pro Ala Gly Ser Pro Ala Ser Arg Ala Pro Ser Ala Ala Cys Arg His 
25 85 90 95 

Arg Pro Ser Leu Gin Val Met Leu lie Leu Val Pro Gly Ser Ala Thr 
100 105 110 

30 Ser Cys Gly Val Ala Met Val Thr Pro Arg Asn Leu Glu Glu Arg Arq 
115 120 125 



35 



Ala Gly Arg Ser Gly He Thr 
130 135 



(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



45 



50 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Ser Leu Arg Gin His Glu Gin Gly Asp Gly Gly Glu Glu Glu Ser Gly 
15 10 15 

Gly His Cys Arg Pro Val His Gly Pro Thr Pro His Ala Gly Lys Ala 
20 25 30 



He Pro Ser Ala Val Leu Arg Leu Arg Leu Arg Val His Pro Ala Val 
55 35 40 45 

Leu Pro Arg Val Gin Gly Gin His Pro Ala Val Ala Leu Gin Asp Gin 
50 55 60 

60 Val Arg Arg 
65 
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Claims 

We claim: 

1 . An isolated nucleic acid molecule, comprising an ENDl promoter. 

2. An isolated nucleic acid molecule, comprising a NUCl promoter. 

3 . An isolated nucleic acid molecule, encoding a product of END 1 . 

4. An isolated nucleic acid molecule, encoding a product of NUCl . 

5. A vector, comprising a nucleic acid molecule of claims 1 or 2. 

6. A vector, comprising a nucleic acid molecule of claims 3 or 4. 

7. The vector according to either claims 5 or 6 wherein said vector is an 
expression vector. 

8. The vector of claim 7 wherein the vector is a binary Agrobacterium 
tumefaciens plasmid vector. 

9. The vector according to claim 5, further comprising a nucleic sequence 
encoding a foreign gene operably linked to said promoter. 

10. The vector according to either claims 5 or 6, further comprising a 
selectable marker. 



1 1 . The vector according to claim 9 wherein said foreign gene confers 
disease resistance. 
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12. A host cell containing a vector according to any one of claims 1-11. 

13. The host cell of claim 12 Mdierein the host cell is a plant cell. 

14. The host cell of claim 13 wherein said plant cell is selected fiom the 
group consisting of barley, com, sunflower and soybean. 

15. A method of producing a foreign gene product, comprising: 

(a) introducing a vector according to claim 9 into a host cell, 
wherein the vector contains a foreign gene in an expressible form; and 

(b) growing the host cell under conditions wherein the foreign gene 

is expressed. 

16. The method of claim 15 \iiierein said host cell is a plant cell. 

17. The method of claim 16 wherein said plant cell is selected from the 
group consisting of barley, com, sunflower and soybean. 

18. The method of claim 17 wherein transfection is by Agrobacterium co- 
cultivation of bombardment. 

19. A method of producing a plant that expresses a foreign gene, 

comprising: 

(a) introducing a vector according to claim 9 into an embryogenic plant 
cell, wherein said vector contains a foreign gene in an expressible form; and 

(b) producing a plant from said embryogenic plant cell, wherein said plant 
expresses said foreign gene. 



20. The method of claim 19 wherein the step of introducing is by 
Agrobacterium co-cultivation or bombardment. 
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FIG. 3 
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